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A BELLMAN FUNCTION COUNTEREXAMPLE TO THE A x 
CONJECTURE: THE BLOW-UP OF THE WEAK NORM ESTIMATES OF 
WEIGHTED SINGULAR OPERATORS 

FEDOR NAZAROV, ALEXANDER REZNIKOV, VASILY VASYUNIN, AND ALEXANDER VOLBERG 


Abstract. We consider several weak type estimates for singular operators using the Bell¬ 
man function approach. We disprove the Ai conjecture, which is a weaker conjecture than 
Muckenhoupt-Wheeden conjecture disproved by Reguera-Thiele. 


1. Introduction 

Maria Reguera |5j disproved Muckenhoupt-Wheeden conjecture. Then Maria Reguera and 
Christoph Thiele disproved Muckenhoupt-Wheeden conjecture [6j, which required that the 
Hilbert transform would map LA^AIw) into L 1,00 (u;). It has been suggested in Perez’ paper 
HJ that there should exist such a counterexample, also [3J has several very interesting positive 
results, where Mw is replaced by a slightly bigger maximal function, in particular by Ai 2 w 
(which is equivalent to a certain Orlicz maximal function). 

Here we strengthen Reguera and Reguera-Thiele results by disproving the so called A\ 
conjecture (which also seems to be rather old and due to Muckenhoupt). The reader can get 
acquainted with the best so far positive result on A\ conjecture in the paper |lj. 

The Ai conjecture stated that the Hilbert transform would map L l (w ) to L 1,00 (w ) with 
norm bounded by constant times [w]ai (the A\ “norm” of w). Recall that [w]ai := sup ■ 
Therefore, A\ conjecture is weaker than Muckenhoupt-Wheeden conjecture, and, hence, it is 
more difficult to disprove it. And, in fact, in the A\ norm of the weight is uncontrolled, 

while we need to construct a rather “smooth” w to build our counterexample. 

The A\ conjecture is also called a weak Muckenhoupt-Wheeden conjecture. We prove that 
the linear estimate in weak Muckenhoupt-Wheeden conjecture is impossible, and, moreover, 
the growth of the weak norm of the the martingale transform and the weak norm of the 
Hilbert transform from L l {w ) into L 1,00 (w) is at least c[w\ai logs _e [w]Ai- Paper (1] gives an 
estimate from above for such a norm: it is < C [ic]ai log]^]^- We believe that this latter 
estimate might be sharp and that our estimates from below can be improved. 

The plan of the paper: first we repeat the result of [7], where the exact Bellman function 
for the unweighted weak estimate of the martingale transform has been constructed. Then 
we show the logarithmic blow-up for the weighted estimate of the martingale transform in 
the end-point case w € A\. Then we adapt this result to obtain the same speed of blow-up 
for the Hilbert transform. 
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2. Unweighted weak type of 0 shift 

Here we review the work [7], where the Bellman function and the extremizers were con¬ 
structed for the unweighted martingale transform. The unweighted problem is much easier 
than the weighted problem that we consider in the current article. However, a glance at a 
simpler problem helps us to set up a more difficult one and to understand the difficulties. So 
we start with unweighted martingale transform, and briefly recall the reader the set up and 
some of the results of |7]. 


We are on I o := [0,1]. As always D denote the dyadic lattice. We consider the operator 

ici 0 ,igd 


where — 1 < ej < 1. Notice that the sum does not contain the constant term. 
Put 

F ■= (M>J, / : = (<p)i, 

and introduce the following function: 

:= sup -j- |{x G I : ^ ej(ip, hj)hj(x) > A}|, 

jc/,jgd 


where the sup is taken over all —1 < ej < 1, J £ D, J C /, and over all ip £ L l {I) such that 
F := (|<£>|)/ , / := ( ip)i , hj are normalized in L 2 (M) Haar function of the cube (interval) /, 
and | • | denote Lebesgue measure. Recall that 


hi(x ) 


Vw ’ 

i 

Vw 


x € I + 

, x el- 


This function is defined in a convex domain Slcl 3 : H := 


{(F,/,A)€M 3 :|/|<F}. 


Remark. Function B should not be indexed by I because it does not depend on /. We will 
use this soon. 


2.1. The main inequality. 


Theorem 2.1. Let P, P +1 P_ € Q, P = (F. /, A), P+ = {F + a, / + /3, A + /?), P_ = (F- 
a, f —/3,\ —/3). Then 

(2.1) B{P)- l -{B{P + ) + B{P_))> 0. 

At the same time, if P,P + ,P- € Ll,P = (F,f,X), P + = (F + a, f + /3, A — (5), P_ = 
(F — a, f — /3, A + /3). Then 

(2.2) B(P)-±(B(P+) + B(P-))> 0. 


Proof. FixP,P + ,P_ €U,P = (F,/,A),P+ = (F+a, f + (3,\ +(3), P- = (F-a,f-P, A-/3). 
Let <p + ,ip- be functions giving the supremum in B(P + ),B(P _) respectively up to a small 
number p > 0. Using the remark above we think that <p+ is on I + and p>- is on Consider 


p(x) 


p+(x ), x € 1+ 
<p~(x ), x £ I- 
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Notice that then 
(2.3) 


O/bM 


1 




= P- 


Then it is easy to see that 

(2.4) (\<p\) I = F = P 1 , (, p)j=f = P 2 . 

Notice that for x £ I + using (12.31) . we get if e/ = — 1 

Tjrl {x£l+: Y ej{<P,hj)hj(x) > A}| = ^ \{x £ /+ : Y ej(p, hj)hj(x) > \ +/ 3 }\ 
' ' Jci,JeD ' ' jc/+,JeD 


21/4 


iTTfl {x£l+: Y ej(<P+,hj)h,j( x ) > P+, 3 }| > \ B (P + ) -77. 

jci + ,jgd 

Similarly, for x £ /_ using (12.31) . we get if ej = — 1 

Tjrl {x£l~: Y ej(<P,hj)hj(x)>\}\ = -^-\{x£l-: Y ej{v,hj)hj(x) > \- f3} \ 
' ' Jci,JeD ' ' jci +) JeD 


1 

2171 


|{* e I- 


Y ej(ip-,hj)hj(x) > P_, 3 }| > ]~ 2 B(P-) ~ V ■ 

JC/_,JgD 


Combining the two left hand sides we obtain for ej = — 1 


Tjrl {x£l+- Y £j(v,hj)hj(x) > A}| > ^(B(P + ) + B(P_)) - . 

1 1 Jci,jeD 


Let us use now the simple information ()2.4I) : if we take the supremum in the left hand side 
over all functions ip, such that (|<^|)i = F, (p)j = /, and supremum over all ej £ [—1,1] (only 
e/ = — 1 stays fixed), we get a quantity smaller or equal than the one, where we have the 
supremum over all functions p, such that (|^|) = F, (p)j = /, and an unrestricted supremum 
over all ej £ [—1,1]. The latter quantity is of course B(F, /, A). So we proved (12. 1 j) . 

To prove (12.2ji we repeat verbatim the same reasoning, only keeping now e/ = 1. We are 
done. 

□ 


Denote 

T p-= Y ■ 

jci,j£D 

It is a dyadic singular operator (actually, it is a family of operators enumerated by sequences 
of ej £ [—1,1]). To prove that it is of weak type is the same as to prove 

(2.5) 5(F,/,A)<—,A> 0. 

Our B satisfies (|2.ip . (|2.2|) . We consider this as concavity conditions. 

Let us make the change of variables, (F, /, A) —>• ( F , 2 / 1 , 2 / 2 ) - 

Vi ■= \ j(A + /), V2 := ^(A — /). 

Denote 

M(F , 2/15 2 / 2 ) := B(F,yi - y 2 ,yi + y 2 ) = B(F, f, A). 

In terms of function M Theorem o reads as follows: 
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Theorem 2.2. The function M is defined in the domain G := {(F, 2 / 1 , 2 / 2 ) : \yi ~ 2 / 2 1 < F}, 
and for each fixed 2 / 2 ? M(F, yi, •) is concave and for each fixed y%, M(F, 7 / 2 ) is concave. 

Abusing the language we will call by the same letter B (correspondingly, M ) any function 
satisfying (12.11) . (12.21) (correspondingly satisfying Theorem 12.21) . 

It is not difficult to obtain one more condition, the so-called obstacle condition: 

Lemma 2.3. 

(2.6) If X < F then B(F, f, A) = 1. 

Proof. Let us first consider the case f = F, which can be viewed as the case of non-negative 
functions <f. Fix Ao and e > 0, let p be a non-negative function on I = [0,1] such that it 
looks like (Ao + e)5o, and F = f := f 0 pdx = Xo + e > Xq. Namely, ip is zero on the set of 
measure 1 — r, and an almost 5 function times Ao + e on a small interval of measure r. 

As it looks as a multiple of delta function, it can be written down as Ao + e + H, where H is 
a combination of Haar functions, and a martingale transform of cf, namely, — H = Ao + e > Ao 
on a set of measure 1 — r with an arbitrary small r (the smallness is independent of Ao and 
e). Then the example of p shows that 

B( Ao + e, Ao + e, Ao) > 1 — t . 

We have to consider the case of / < F as well. If / > Ao, the construction is the same. 
Namely, consider := p + aS, where S is a Haar function with very small support in a small 
dyadic interval I (say, of measure smaller than r) and normalized in L 1 , let I be contained in 
the set, where p is small (p is small essentially on almost the whole interval, because it looks 
like a positive multiple of the delta function), and ensure that f S dx = 0, and f \S\dx = 1. 
Then the example of p shows that 

B( f |<h| dx, Ao + e, Ao) > 1 — 2r . 

Jo 

By varying a from 0 to 00 we can reach f |<J?| dx = F for any F > Ao + e. Therefore, making 
first r —>• 0 and then e —» 0, we prove (12.61) . 

We are left to consider the case F > Ao > /. Choose e and r much smaller than, say, 
jq(F — /). Consider the same function p, as above. Let H be the first Haar function, namely 
H = — 1 on I- = [0,1/2] and H = 1 on I + = [1/2,1], Let us consider now if := p + c\ H — C 2 , 
ci > C 2 > 0. Then 

(if) = A 0 + e - c 2 , (\if\) = A 0 + ci -I- 0(t). 

It is easy now to choose ci, C 2 such that the first average above is equal to a given number /, 
and the second one is equal to a given F, F > /. Now on the set E of measure 1 — r we have 
if = c\H — C 2 - On the other hand if = Ao + e + c\H — C 2 + Hi, where H\ is a combination of 
Haar function, each of which is orthogonal to H. 

Hence, —Hi = Ao + e > Ao on E of measure 1 — r. But —Hi is the martingale transform 
of if in our sense. In fact, we just consider the Haar decomposition of if, forget the constant 
term, and multiply all Haar coefficients on —1 except the first one, which is got multiplied 
by 0. 

We obtain that B(F, f, Ao) > 1 — r. We are done. 


□ 
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Theorem 2.4. Let B > 0 satisfy (12.11) . (12.21) . (Equivalently, let the corresponding M > 0 be 
concave in (F,y i) and in (F,y 2 ).) Let B satisfy (12.51) . or, equivalently, 

C F 

(2.7) M(F,yi,y 2 ) <---, yi + y 2 > 0 . 

Vi + 2/2 

Let B(F,f,X) = 1 if X < 0. Then we have the weak type estimate with constant at most C 
for all T uniformly in ej G [—1,1]. 

Proof. Just by reversing the argument of Theorem 12.11 

□ 


Remark. Notice that the Bellman function B defined above satisfies by definition B(F, f, A) = 
B(F, — f, A). Therefore, Lemma 12.31 claims in particular that B(F, /, A) = 1 if A < 0 (and we 
saw that it also satisfies (12.11) . (12.21)1 . 

Here is the Bellman function for unweighted weak type inequality for martingale transform, 
see [7]. 

Theorem 2.5. 

fl, if X < F, 

(2 ' 8) = 

in 0 this formula was found by the use of Monge-Ampere equation. As always in stochastic 
optimal control related problems (and this is one of such, see the explanation in |3|) one needs 
to prove that the solution of Bellman equation is actually the Bellman function. This is called 
“verification theorem”, and it is proved in [7] as well. 

3. Weighted estimate. Ai case 

We keep the notations-almost. Now w will be not an arbitrary weight but a dyadic A± 
weight. Meaning that 

Vleh (w)j < Q inf w . 

The best Q is called [u;]^. Now 

F = (I f\w)i, f = (f)i, X = A, w = (w)i,m = inf w . 

We are in the domain 

(3.1) Ll := {(F, w, m, /, A) : F >\f\m, m<w<Qm}. 

Introduce 

(3.2) M(F, w, m,f,X) := sup T 77 w { x € I : ^ hj)hj(x) > X} , 

' ' jci,jgd 

where the sup is taken over all ej, |ej| < 1, J G D, J C /, and over all / € L l {I,wdx) such 
that F := (|/| w)j , / := (/)/, w = (w)i,m < inf / w, and w are dyadic Ai weights, such that 
V/ € D (w)j < Qmfjw, and Q being the best such constant. In other words Q := [w\ d ^ dlc . 
Recall that hi are normalized in L 2 (M) Haar function of the cube (interval) /, and | • | denote 
Lebesgue measure. 
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3.1. Homogeneity. By definition, it is clear that 

sE(F/s, w/s , m/s, f, A) = B(F, w, m, f, A), 

M(tF, w, m, tf, tX) = B (F, w, m, f, A). 

Choosing s = m and t = A -1 , we can see that 

F f 

(3.3) M(F,w,m,f, A) = mB{ — 

rn\ m A 

for a certain function B. Introducing new variables a = f3 = ^,7 = t we write that B 
is defined in 

(3.4) G := {(a, /?, 7 ) : M < a. 1 < P < Q} ■ 

3.2. The main inequality. 

Theorem 3.1. Let P,P + ,P € f l,P = (F,w,mm(m + ,m-),f,X), P + = (F + a,rc + 
-f,m + ,f + f3,X + l3), P_ = (F - a,w - 7 ,m_,/ - /?, A - /3). Then 

(3.5) B(P)-i(B(P+)+B(P_))>0. 

At the same time, if P, P + ,P_ € Q,P = (F, w, min(m + , m_), /, A), P + = (F + a, w + 
7 ,m+, / + fi, X — /3), P_ = (P - a,w + 7 , m + , / — j3, X + /3). Then 

(3.6) B(P)-i(B(P+)+B(P_))>0. 

/n particular, with fixed m, and with all points being inside Ll we get 

B(P, w, m, f, X) — ~(M>(F—dF,w—dw, m, f—dX, X — dX)+M(F—dF,w—dw, m, f+dX, X — dX)+ 

(3.7) B(P + dF, w + dw, m, f — dX, X + dX) + B(P + dF, w + dw, m, f + dX, X + dX)) > 0 . 

Remark. 1) Differential notations dF,dw,dX just mean small numbers. 2) In (13.71) we loose 
a bit of information (in comparison to (13.51) . (13.61) b but this is exactly (13.71) that we are going 
to use in the future. 


Proof. Fix P, P + , P_ € Ll. Let (p+, (/?_, w+, w— be functions and weights giving the supremum 
in P(P + ),P(P_) respectively up to a small number rj > 0. Using the fact that B does not 
depend on I, we think that ip + ,w + is on I + and is on Consider 


ip(x) 


T+(x), x G 1+ 
ip~(x ), x € /_ 


Notice that then 

(3.8) 

Then it is easy to see that 

(3.9) 


^ ): |w + (x),^/ + 

I W-(x), x G i_ 

(|^|£V)/ = P = Pi, <¥?)/ = / 


P4. 
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Notice that for x G /+ using (13.81) . we get if ej = — 1 


\I\ 


w + {x G /+ : E e/(<£, hj)hj(x) > A} = w + {x G 1+ : ^ ej(<£, hj)hj(x) > X+/3} 


jci + ,jgd 

= fiTfl w+{x€l+: Y ej{ip + 1 hj) h j{x) > P+, 3 } > \ B( y P +) ~V- 

' jci + ,J£D 

Similarly, for x G /_ using (13.81) . we get if ej = — 1 

€ /- : ^ ej(if,hj)hj(x) > A} = ^—w-{x G /_ : Y ej(tp,hj)hj(x) > X-fi} 

' ' jci,jgd ' ' j<zi-,JeD 

.{xG/_: Y £j{V-,hj)hj{x) > P_ 3 } > ^P(P-) - 77 . 


1 


2 /_ 


r uM 


JQI-J&D 

Combining the two left hand sides we obtain for ej = — 1 
1 




w{x G/+ : ^ ej(tp,hj)hj(x) > X} > ]-(B(P + ) + P(P_)) - 2rj. 


JCI,J+D 

Let us use now the simple information (|3.9I) : if we take the supremum in the left hand side 
over all functions ip, such that (\ip\w)i = F, (ip)i = /, (u) = w, and weights c 0 : (u) = w, in 
dyadic A\ with A]-norm at most Q, and supremum over all ej = ±1 (only ej = —1 stays 
fixed), we get a quantity smaller or equal than the one, where we have the supremum over all 
functions (p, such that (|</?| u) = F, (p)j = /, (u) = w, and weights u: (u) = w, in dyadic A\ 
with Ai-norm at most Q, and an unrestricted supremum over all ej = ±1 including e/ = ±1. 
The latter quantity is of course B(F, w, m, /, A). So we proved (12.11) . 

To prove (12.21) we repeat verbatim the same reasoning, only keeping now e/ = 1. We are 
done. 

□ 


Remark. This theorem is a sort of “fancy” concavity property, the attentive reader would see 
that (13.50 . (13.61) represent bi-concavity not unlike demonstrated by the celebrated Burkholder’s 
function. We will use the consequence of bi-concavity encompassed by (|3.7I) . There is still 
another concavity if we allow to have |ej| < 1 . 


Theorem 3.2. In the definition of B we allow now to take supremum over 
P, P+, P- € Ll, P = (F, w, m, /, A), P + = (F + a, w + 7 , m, / + /3, X), P_ 
7 , m, f — /5, A). Then 


all \ej\ < 1. Let 
= (F — a, w — 


(3.10) 


B(P) - i(B(P+) + B(P_)) > 0 . 


Proof. We repeat the proof of (|3.5I) but with ej = 0. 


□ 


Theorem 3.3. For fixed F, w, /, A function B is decreasing in m. 

Proof. Let m = min(m_,m+) = m_. And let m + > m. Then ()3.5I) becomes 

B(F, w, m, /, A) — B(P, w, m+, /, A) > 0 . 


This is what we want. 


□ 
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3.3. Differential properties of B translated to differential properties of B. ft is 

convenient to introduce an auxiliary functions of 4 and 3 variables: 

B(x, y,f, A) := 

Of course 

(3.11) B(F, w,m,f, A) =mB(—,—,f, A) = mB{^—, —, {) • 

m m mX m A 

Lemma 3.4. Function B increases in the first and in the second variable. 


Proof. We know that by definition the RHS of (13.111) is getting bigger if A is getting smaller. 
So let us consider Ai > A 2 , Ai = A 2 + 5, and variables F, w, m, f fixed, and choose (and a 
weight uj), {(j>i) = f+e, (|0i|u;) = F, which almost realizes the supremum B (F, w, m, f+e, Ai). 
Consider 2 such that 02 = 0i — h. Function h will be chosen later, however we say now that 
h is equal to a certain constant a on a small dyadic interval i and is zero otherwise. Constant 
a and interval f we will chose later. But e := (h) will be chosen very soon. Function 2 
competes for supremizing B at ((\<f> 2 \u), w, m, f, A 2 ). We choose e in such a way that 


(3.12) 


iff 1) = / + e 
Ai Ai 


Let us prove that (13.121) implies that 


/ 

Ar - 5 


( 02 )_ 

A 2 


(3.13) 

By (13.121) this is the same as 


Ai A 2 


(1^2 + h\u>) < (0l) = (02) + £ 

(102 |w) “ (0 2 ) (02) 

The previous inequality becomes 

<102 + h\u) Jfi)_ 

(|02|w) _ (0 2 ) ' 

By triangle inequality the latter inequality would follow from the following one 

We can think that the minimum ra of w is attained on a whole tiny dyadic interval l (we 
are talking about almost supremums). Put h to be a certain a > 0 on this interval and zero 
otherwise. Of course we choose a to have (h) = e, where e was chosen before. Now the 
previous display inequality becomes 

(| 02 | w) > (0 2 ) ■ m, 

which is obvious. 

Notice that B((|</> 2 |), w, m, f, A 2 ) as a supremum is larger than the w-measure of the level 
set > A 2 of the martingale transform of <f> 2 - But this is also the martingale transform of (pi. 
The A 1 -level set for any martingale transform of is smaller, as Ai > A 2 - But recall that we 
already said that 6 1 (and weight u) almost realizes its own supremum B (F, w,m, f + e, Ai) = 
®((|0iI), it ;, m , ( 01 ), Ai) So 

B((|0i|),u;,m, (0i), Ai) < B(<|0 2 |>, w, m, (0 2 ), A 2 ) ■ 




















Ai CONJECTURE 


9 


In other notations we get 

(|^l|) W (^i) . <1021) W QfoK 

l mAi ’ m’ Ai — l mA 2 ’m’ A 2 

Let us denote the argument on the LHS as (xi,yi, z\), and on the RHS as (x 2 ,y 2 , £ 2 )- Notice 
that yi = y 2 ='■ U trivially and z\ = Z 2 =: 2 by (13.121) . Notice also that x\ < X 2 by (13.131) . 
Moreover by choosing 5 very small we can realize any x\ < X 2 as close to X 2 as we want. 
Then the last display inequality reads as 

B(x!,y,z) < B(x 2 ,y,z). 

So we proved that function B increases in the first variable. 

The increase in the second variable is easy. Choose a dyadic interval I on which inf / uj > m, 
but (uj)i/ inf/ uj < Q =: [w]^. For non-constant ui this is always possible, just take a small 
interval containing a point xo, where uj(xq) > m. Then augment won / slightly to get ui\ 
with (cui) = w + e. It is easy to see that as a result we have the new weight with the A\ norm 
at most Q, the same global infimum m but a larger global average (uj). The uj\ measure of 
the level set of the martingale transform will be bigger than uj measure of the same level set 
of the same martingale transform, and w/m also grows to (w + e)/m . All other variables stay 
the same. So if the original uj (and some </>) were (almost) realizing supremum, we would get 

B(x,yi,z) < B(x,y 2 ,z) 

for y\ = w/m, y 2 = (w + e)/m. □ 


Theorem 3.5. Function B from (13.31) satisfies 

(3.14) + ' +_1 

(3.15) 


(3.16) 


t —> t L B(at,/3t, 7 ) is increasing for — < t < ^ 


T3( X /n 1 

^ A _ 4 


' x-dx 

B( xZdX’ y - dv 


B is concave 
f-dX 


X-dX 


x-dx f + dX 

) + B (~x - TT) V ~ dy , T - ry) + 


X — dX 


x + dx f-dX x + dx f + dX s 

B ( ^ , j^ y + dy, . , ,. ) + ^( x , _ 7 , ,y + dy, 


'X + dX 


X + dX' 


'X + dX 


X + dX' 


X-dX' 
> 0 . 


Proof. These relations follow from Theorem 13.31 Theorem 13.21 and Theorem 13.11 (actually 
from (13.71) 1 correspondingly. □ 


We can choose extremely small £0 aR d inside the domain 11 we can mollify B by a convo¬ 
lution of it with eo-bell function supported in a ball of radius £o/ 10 . 

Multiplicative convolution can be viewed as the integration with where 5 = 

£o/10. Here xq is a point inside the domain of definition 11 for function B. 

This new function we call B again. It is exactly as the initial function B, and it obviously 
satisfies all the same relationships, in particular it satisfies Theorems 13.1113.2113.31 Only its 
domain of definitionl2 eo is smaller (slightly) than H. The advantage however is that the new 
B is smooth. We build B by this new B. A new function B defined by the new B as in (13.111) 
will be smooth. Actually the new B should be denoted B e ° , where superscript denotes our 
operation of mollification, but we drop the superscript for the sake of brevity. In fact, all these 
mollifications are for the sake of convenience, the new functions satisfy the old inequalities 
in the uniform way, independently of £q. Property (I3.16P can be now rewritten by the use of 
Taylor’s formula: 
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Theorem 3.6. 


(dx d\ 


OL Baa ( . 

\ X X 


- P 2 Bpp 



-(1 + 7 2 )B 


„ „ / dx dX \ dy „ ^ dy dX „ 

- 2a/3B a p i— - y ) ~ + 2 fB/ B 8 y — + 2oqB, 


77 ^ 

dx 

x 


dX 


d\\ dX 

tJt + 


+2 otB a 


dx 

x 


dX\dX 


X X 


2 yB. 




□ 


Proof. This is just Taylor’s formula applied to (13.161) . 

Denoting 

dx dy dX 

x y ' ^ X 

we obtain the following quadratic form inequality 

Theorem 3.7. 

— p 2 [a 2 Baa + P^Bpp + 2oipB a p\ — y 2 \orB aa + (1 + i 2 )Byy + 2oLiB a y + 2uB a + 2 yBy\-\- 
+2 fy [o?B aa + a PB a p + PlBpy + a'yBay + o.B a ] + 0 . 

Now let us combine Theorem 13.71 and Theorem 13.21 In fact, Theorem 13.21 implies 

—2ayB a yy 2 < —a 2 iB a ay 2 — 7 Byyy 2 . 

We plug it into the second term above. Also Theorem 13.21 implies 

2a^Bayfy < -a 2 iB aa p 2 ~ iByyfi , 

fifiBpypy < -$ 2 lBppf 2 - yByyr) 2 , 

We will plug it into the third term above. Then using the notation 

if(a,P,i) := -a 2 Baa ~ 2 af3B a p - P 2 Bpp 


(which is non-negative by the concavity of B in its first two variables by the way) we introduce 
the notations 

K ■= if (a, P, 7 ) + {-a 2 Baa ~ P 2 Bpp)i , 

L \= -if {a, ( 3 , 7 ) + ( a 2 B a )a - P 2 Bpp , 

N := -(1 + 37 + 7 2 )Byy - 27 By - (a 2 Ba)a - a 2 Baal ■ 

And we get that the following quadratic form is non-negative: 

£ 2 K + in L + n 2 N := 
f 2 [if (a, P, 7 ) + {-a 2 Baa ~ P 2 Bp 0 ) 7 ] + 
fy [-if {a, P, 7 ) + (a 2 B a ) a ~ P 2 Bp 0 }+ 

I ] 2 [-(1 + 37 + l 2 )Byy - 27 By - (q 2 B a ) a ~ a 2 B a al\ > 0 . 

Therefore, K is positive, and 

(3.17) N>fi. 

Now we will estimate L from below, K from above and as a result we will obtain the 
estimate of N from below, which will bring us our proof. 





Ai CONJECTURE 


11 


But first we need some a priori estimates, and for that we will need to mollify B = B e ° in 
variables a, (3. Again we make a multiplicative convolution with a bell-type function. Let us 
explain why we need it. Let 

Q := supL>/a. 

G 

We want to prove that 

(3.18) Q/Q^oo. 


First we need to notice that 

(3.19) [ ip {at, j3t , 7 )dt <C (Qj + ^-a), ip (a, f3, 7 ) := -a 2 B aa - 2 a/3B a/3 - /3 2 Bpp . 

•/ 1/2 Q 

In fact, consider (3 € [Q/4, Q/2], b(t ) := B(at,/3t, 7 ) on the interval 77 =: to < t < 1. Let 
£(i) = b(l)t < Qta. We saw that b(t)/t is increasing and b is concave, and b is under £, and so 
by elementary picture of concave function having property bp)/- increasing and bp) concave 
on the interval [t' 0 , 1 ] we get that the maximum of £(■) — b(-) is attained on the left end-point. 
The left end-point t' 0 is the maximum of 1 0 = \'y\/ot and 1//3 which is c/Q. Therefore, 

£{t) - b{t)\{t = (max(—, ^)) < f(max(-, ^)) < CQa max(-, ^-) < Q7 + Q-a , 
a Q a Q a Q Q 

and the above value is maximum of g{t ) := £{t) — b(t) on [tg, 1]. By the same property that 
b(t)/t is increasing we get that 

g'(l)=£'(l)-b'{l) = b{l)-b'(l)<0. 


Combining this with Taylor’s formula on [to, 1] we get for g := £ — b {g is convex of course): 


(3.20) —{l — to)g'{l)+[ dt f g"{s)ds = positive + f {s —to)g"{s)ds < supg < Q'y+^-a . 

Jt 0 Jt Jt 0 Q 


This implies (13.191) because g"(t ) = -pip(at, f3t, 7 ), t € [1/2,1], 

Consider now function apt ) := B(at,/3, 7 ) We also have the same type of consideration 
applied to convex function Qa — apt) bringing us 


(3.21) 

Similarly, 



—a 2 B aa (at, (3, 7 ) dt < CQa . 


(3.22) 



a, f3t, 7 ) dt < CQa. 


We used here that B a > 0, Bp > 0, which is not difficult to see. 

For the future estimates we want ([3.191) . (13.211) . ([3.221) to hold not in average but pointwise. 
To achieve the replacement of “in-average” estimates ([3.191) . (13.211) . ([3.221) by their point- 
wise analogs let us consider yet another mollification, now it is of B: 

B n ew(a,P, 7 ) := 2 / B(at,/3t,j) dt. 

J 1/2 
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The domain of definition of B new is only in tiny difference with the domain of definition of B. 
In fact, the latter is {(a,/3, 7 ) : I 7 I < a, 1 < /? < Q}, and the former is just G := {(a, /3, 7 ) : 
|7| < \a ,2 < ft < Q}. 

If we replace (a,/3,'y) by (at,/3t, r /), 1/2 <t< 1, everywhere in the inequality of Theorem 
13.71 and then integrate the inequality with 2 j^ 2 ... dt, we will get Theorem Ef] but for B new . 


It is not difficult to see that (13.191) becomes a pointwise estimate for B new (just differentiate 
the formula for B new in a,/3 ,7 and multiply by a,/3 ,7 appropriately): 


(3.23) - a 2 (B new ) aa - 2a/3(B new ) a/3 - p 2 (B new )pp < C(Q 7 + ^ a ). 

This pointwise estimate automatically imply new “average” estimate: 

2 [ ( - a 2 s 2 (B new ) aoi (as,/3,'y) - 2as/3(B new ) a/3 (.,/3 ,.) - /3 2 {B new )pp) < C(Qj + ^a). 

J 1/2 V 

This means exactly that the function 

B .— (B new ') new .— 2 / Bias, (5, 7 ) ds 
J 1/2 

still satisfies (13.231) . It also clearly satisfies the inequality of Theorem 13.71 because (as we 
noticed above) B new satisfies this inequality. To see this fact just replace all a’s in the 
inequality of Theorem 13.71 applied to B new by as and integrate 2 f^ 2 ... ds. 

Now let us see that B = ( B new ) new also satisfies a pointwise analog of (13.211) . namely, that 

(3.24) — a 2 B aa (a, /3, 7 ) < CQa . 


To show (13.241) we just repeat what has been done above. Let g(t ) := Qa — B new (at, (3, 7 ). 
Then we have: 1) 0 < g < Qa on [ffi, 1], 2) </(l) < 0 (we saw that B, and hence B new , are 
increasing in the first argument), 3) g is convex. Then we saw in (13.201) that 

f s 2 g"(s) ds < f g"(s)ds<CQa. 

J1/2 J 1/2 

But this is exactly (13.241) . 

So far we constructed a function B = ( B new ) new that satisfies pointwise inequalities (13.231) . 
(13.241) and the inequality of Theorem 13.71 We are left to see that by introducing 

B := 2 f B(a, /3s, 7 ) ds 
J 1/2 

we keep (I3.23|) , (13.241) and the inequality of Theorem 13.71 valid and also ensure 
(3.25) — P 2 B^/ 3 (a, /?, 7 ) < CQa . 


W already just saw that (13.231) . (|3.24l) and the inequality of Theorem 13.71 are valid for B 
just by averaging the same inequalities for B. We can see that (13.251) holds by the repetition 
of what has been just done. Namely, consider g(t ) := Qa — B(a, /3f, 7 ). Then we have: 1) 
0 < g < Qa on [to, 1], 2) (/(l) < 0 (we saw that B, and hence B new , B are increasing in 
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the first argument), 3) g is convex. Using (13.201) again in exactly the same manner as we did 
with proving (|3.24l) we get 



g"(s) ds < CQa. 


But this is exactly (13.251) . 


We drop “hat”, and from now on B is just denoted by B. We can summarize its properties 


as follows. 


(3.26) 

0 < p>(a, P, 7 ) < C(Q 7 + Qa] 

(3.27) 

0 < — a 2 B aa (a, p, 7 ) < CQa 

(3.28) 

0 < ~p 2 Bpp{a,P, 7 ) < CQa 


Recall that (now with this mollified B): 

f K+ ^r]L + r] 2 N := 


f P, l) + (~a?B aa - P 2 Bpp) 7 ] 


ig [-ip(a, /3, 7 ) + (a 2 B a ) a - /3 2 j 
rj 2 [-(1 + 3y + 7 2 )R 77 - 27 R 7 - ( a 2 B a ) a - a 2 B aQ 7 ] > 0 . 

We will choose soon appropriate 0 : 0,07 < ^gOo and 7 < too with some small r. Let us 


introduce 


rao rao 

k := K da = [ 

J a\ J a 1 


(a, ft, 7 ) + ( -a 2 B aa - /3 2 Bpp) r y\ da , 


n := 


/»Q;o ^cxo 

:= / N da = [—(l + ^ r y + 'y 2 )B^ — 2^B^ — (a 2 B a ) a — a 2 B oia 'y]da 1 

J a 1 */ cki 

rao 

£:= [-i/>(a,^,7) + {a 2 B a ) a - p 2 Bpp\da. 

J ai 

Estimate of k from above. The integrand of k is obviously positive and if> term dominates 
other terms (by (13.261) . (J3.27I) . (|3.28l) and the smallness of 7 ). Therefore, 


(3.29) 0 < k < Ci {Q^/a® + C-a^) + C 2 Q7 q o — C {Ql a ® + > 

if Q is very large. We choose (we are sorry for a strange way of writing 00 , why we do that 
will be seen in the next section) 


(3.30) 


«o 



1 I Q_ a 
100 y Q a ° 


Here c is a small positive constant. We also choose to have 7 running only on the following 
interval 


(3.31) 


7 € [ 0 , 70 ] , 70 := r 



p 

« 0 , 


P = 


1 , 


where r is a small positive constant. 
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Estimate of £ from below. Estimating from below we can skip the non-negative term 
—/3 2 Bpp. Also 

rao - ,Q - 


r<x o 

/ /3, 7 ) > -CQ~/a 0 - C 

J OL\ 


Q 


-(y.r\ . 


On the other hand, 


rot 0 

/ < 

J a 1 


(■ a 2 B a ) Q da > alB a (a 0 , /?, 7 ) — afQ , 


as mollification gives a pointwise estimate 


(3.32) B a <CQ. 

Recall that /3 € [Q/4, Q/2}. We also will prove soon the obstacle condition (13.451) . which 
says that 


(3.33) B( l,/3, 7 ) > f - 

If B a (ao, Pjj) would be smaller than Q /40 (and then B a (s,f3, 7 ) < Q/40 for all s € [ao, 1] 
by concavity of B in its first variable) we would not be able to reach at least ^L. In fact, by 
our choice of ao in (13.301) we have 

(3.34) R(a 0 ,/3,7) < Qa 0 < cQ. 

If B a (ao, 7 ) < and so this derivative B a (s,/3, 7 ) < | on s £ [ao, 1] (concavity), we 
cannot reach Q/( 4-8) for s = 1 if we start with value of B in (13.341) at s = a q. But the fact 
that we cannot reach Q/(4 • 8 ) contradicts to (13.331) . Therefore, 

(3-35) B a (a 0 ,/3, 7 ) > ^ , 

and 

(3.36) £> - a 2 Q - C Q^a 0 - C^a 2 . 

As a\ = T5o a o \[^ ( see ^3.301) 1. the second term is dominated by the first; the third term 
is dominated by the first because of the choice of 70 in (13.311) . the fourth term is dominated 
by the first one because Q 2 » Q , see [4j for a much better estimate. 


Finally, 

(3.37) 

And k is 

We got 

(3.38) 


2 

£ > >calQ. 

0 < k < C (Qja 0 + C^al) = a 0 Q (7 + ^«o) • 

f 2 alQ 2 

n > — > c --—----. 

4k at 0 Q (7 + ^a 0 ) 


Estimate of n from above. By (j3.35l) . (|3.32l) and (13.271) we get 
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fa. o rcto 

/ —(a 2 B a ) a da — 'j / ot 2 B aa da < — cQcxq + CQa\ + cQ«o7 < 0 . 

J a i J an 


t Ot .1 »/ Oil 

Negativity is by the choice of a\ in (13.3011 and by the fact that 


(3.39) 


7 < c, 


which is much overdone in (13.311) . 

Therefore, we get, combining with (13.3811 (here 77 > is an absolute constant and it is at 
least the maximum of all our 3y + y 2 ) 




Qh + T 3 a o) 


< n < - 


r a o _j_ 2 

(1 + rj) (e 1 +') 7 S 7 ) 7 da, 
J CKl 


or 

(3.40) 


r a ° i 2 

/ (—e 1 + ? J 7 Bj)jda>C- 

J CXl 




'«1 Q (Qi + «o) 

Function B is smooth, concave in 7 and symmetric in 7 (the latter is by definition). In 
particular By(a,/3, 0) = 0. So after integrating in 7 on [ 0 , 7 ], 7 < 70 we get 

f a 0 Q 2 (02 Q 

(3.41) / (-5 7 )da > C'ao- 7 -[log(a 0 + Q 7 )-loga 0 ] = Cao-^-log(l + — 7 ). 

Jai Q Q a 0 


Integrate again in 7 on [ 0 , 70 ]. We get the integral over [07,07] of the oscillation of B, 
which is 

r a 0 

/ [B(a,f3,0) - B(o,/3,7o)]do > Cajj-i- ■ ^(1 + Q— )log(l + Q —). 

./a! Q Q «o «o 

But this oscillation is smaller than CQot q. We get the inequality 


(3.42) 


C (1 + Q—) log(l + Q—) < a 2 Q . 
Q ot 0 o 0 


Notice that 00 , 70 , 70/07 are all powers of 2, which we expect to be a sort of 

Then we get the estimate in terms of powers of S: 

w 


(3.43) 


C 00777— l°g(l + Q —) < 1 • 

Q 2 ot 0 oq 


Let us count the powers of 2: Oq brings power 2— -by (13. 3011 . ^ brings power 1 by (|3.31l) . 
so totally we have (- log Q) 5 P log in the left hand side. 

We can see that if Q < Q\og p Q with p < then (13.43P leads to a contradiction. So we 
proved 

Theorem 3.8. The weighted weak norm of the martingale transform for weights w € A dyadlc 
can reach c \vj\a-i log p [?u] / 4 1 for any positive p < 1/5. 
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3.4. A small improvement: from 1/5 to 2/7. Suppose that we are allowed to transform 
the martingale not just by ej = ±1 but by any \ej\ < 1 (it is not clear whether this is the 
same for weak norm estimate, probably yes). The change will give us that d\\ < \df \, and 
this will mean in articular, that we automatically have that function B(a,f3, 7 ) is concave in 
7 . We observed that it is symmetric in 7 . Together this gives us that 


(3.44) 


B(a, /3, 7 ) < B(a,/3, 0), I 7 I < a; H(a,/3, 7 ) > B(a,/3, 0)/2, I 7 I < a/2. 




Now to improve the constant 1/5 we consider Q + := y QQ. We put 

Q 

ao ■— Ci ~q+ >> a o- 


Two cases appear: 

Case 1. B(ao,(3,0) < Q + clq. Then we replace ao by ao in (13.30jh we replace 70 by 70 = 
a o(-^r) P hi (I3.3ip . and we got (13.341) with B(ao,/3,'y ) < cQ (we use (13.441) here). And a 

result we have (exactly by the same reasoning as above) -B Q (ao,/3,7) > |jj. This the same as 
(|3.35l) but with ao instead ao- Now the main bookkeeping inequality (I3.43|) with ao replacing 
of ao, 70 replacing 70 , gives us a new p = 2/7. 


Case 2 . L>(ao,/3,0) > Q + ao. Then L>(ao,/3,0) > c\Q. And by (13.441) L>(ao,/3,7) > c\Q 
if I 7 I < ao/2. But we saw that B(ao,/3,7) < cQ. Then between ao and a 0 there is a 
point do such that _B a (do,/3,7) > C 2 Q/{c\Q/Q + — cQ/Q) > c^Q + . Then by concavity 
-B Q (ao,/?, 7 ) > C 3 < 5 + . This is exactly (13.351) . but with a bigger constant in the right hand 
side ( Q + in place of Q). 

Therefore we can repeat verbatim the whole body of estimates after (|3.35l) up to the main 
bookkeeping inequality (13.431) . However, in (13.431) Q 2 in the numerator should be replaced 
by (Q + ) 2 . Calculating p we are able again reduce it to 2/7. 

3.5. Obstacle conditions for B. Now we want to show the following obstacle condition 
for B, which we already used: 

(3.45) ^ l7l < then 5(1,/?, 7 ) > ^ . 

Let I := [0,1]. Given numbers |/| < A/4, ^ = A it is enough to construct functions p, if}, w 
on / such that 

Put ip = —a on I _, = b on /++, zero otherwise. And in = 1 on I _U I++, and w = Q 

otherwise. Then put 

1 / := (tp, h/_)/i/„ - (p,h I+ )h I+ . 

Let 0 < a < b and a is close to b. Put A = (a + 6)/4. Then average of p is small with respect 
to A and we can prescribe it. F = (a + 6)/4, m = 1. On the other hand, function 1 / (which is a 

martingale transform of p— (p)) is at least — ( p , hj + )hj + >\b> A on 1 4 , whose in-measure 

is more than ^w(I). So 

(3-46) -B(l,/?, 7 ) > i/3, 

for all small 7 and /3 x Q. This is what we wanted to prove. 
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4. Bellman function and the estimate of weighted weak norm from above in 

A\ CASE 

Let us denote by N k the quantity (w is constant on k-th generation and re € A^ yadtc ) 

1 


N k {V) : = sup —i w{x <E / : 


Y ej(f,hj)hj> A}. 

JCl,J£D,\J\>2~ k \I\ 

Then we have practically by the definition of N k (let V temporarily denotes vector (F, /, A, w, 
and yi := A + /, y 2 ■= A — /) 

(41) _ N k (V-) + N k (V + ) 


N k+1 (V) < sup 

v+,v~,v= V++ 2 V ~ ,\vi+-yi-\=\y 2 +-y 2 -1 


In this language we need to prove that 

(4.2) N k {V) < B{V) for any k and any V e Q k . 

By bi-concavity of B and by (14.11) we immediately see the induction step from k to k + 1. 
We are left to check that 


(4.3) N 0 (V) < B{V ). 

Let us check (14.31) . If A > ^ > |/| we just use B(V) > 0 because for such parameters 

\(f,h [0jl] )\<\f\<X 

and the subset of [0,1], where e[o,i](/> ^[o,i])^[o.i]( x ) is greater than A is empty. 

On the other hand, if A < —, what can be the largest tc-measure of E C [0,1] on which 
e [o,i](/> h[o,i])h[o.i](%) > A? Here is the extremal situation: w is 2 Q — 1 on [0,1/2], and 1 on 
[1/2,1], Function (p is zero on [0,1/2], and constant 2/ on [1/2,1], Then F = /, m = 1 (these 
are data on [0,1]). On the other hand, 

F 

e [o,i](VL tyo,i])tyo,i ]( x ) = t[ 0 ,i] f h[o,i]{x) = — > A 

on the whole [0,1/2] if epq] = ± is chosen in the right way. But in this case again, B(V) > 
2Q- 1 > wj([0, 1/2]). 

Hence, B(V) > Nq(V) is proved, and we can start the induction procedure. 

It is left to find our B to have a sharp estimate from above in A\ problem. 


5. Martingales 


We will use four-adic lattice T. For a four-adic interval I let Hj = 1 on its right half, 
Hi = — 1 on its left half, let also Gj = 1 on its leftest and rightest quarters and Gj = — 1 on 
two middle quarters. We will call martingale difference the function of the type 

fn= Y ajHl 

i&r,i{i)=4~ n 


or 

9n= Yj k/G/, 

ieT,i{i)= 4 -« 

where aj,bj are numbers. Martingale for us is any function on Iq := [0,1] of the type 
f = / + J2n= o fn,org = g + £n=0 g n , where /, g are two constants. We distinguish H- and 
G-martingales. 

In the previous sections the following theorem was proved. 
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Theorem 5.1. Given Q > 1 there exist three H-martingales F,f, w, F > 0, w > 0, and one 
G-martingale g = g + Yln=o w large positive g such that the following holds: 

1) For any I € F, (F)i > (|f|)iminiw. 

2) For any I € T, (w)i < Qmini w. 

3) For any I € T , a/ = bj, where these are martingale differences coefficients for f and g. 

4) 9 ■ /*GJo:g(x)<0 w dx > c Q log p Q f Io F dx, p < l. 

6. Controlled doubling martingales 

We are going to make a small modification in the proof to get the following 

Theorem 6.1. Given Q > 1 there exist three 77 -martingales F,f, w, F > 0, w > 0, and one 
G-martingale g = Yln=o 9n such that the following holds: 

1) For any I € T, (F)i > (|f|)iminiw. 

2) For any I € F, (w)i < Qmini w. 

3) For I G F, bj = —aj, where these are martingale differences coefficients for f and g. 

4) For large positive number g, g • f xeIo : S ( x )> g w dx > c Q log p Q / Io F dx, p <\- 

5) For any two four-adic neighbors (neighbors in the tree) I € F and I, (w)j < 4(w)i. 

In other words, we can always control the four-adic doubling property of w. 

7. Remodeling by proliferation. The amplification of martingale differences 

Now we are going to repeat the procedure from [2j. We say that Iq supervises itself. Take 
a very large ni, consider the division of of Iq to 4 ni small equal intervals and let the leftest 
quarter of the supervisor (7 q it is) supervises the first, fifth, etc small subdivision interval of 
the supervisee (which is still Iq for now, so these are intervals of our just done subdivision). 
Let the second quarter supervises the second, the sixth, etc; the third quarter supervises the 
third, the seventh, etc, and the fourth quarter supervises the fourth, the eighth, etc. 

Now we have new pairs of (supervisor, supervisee). Subdivide each supervisor to its 4 sons 
and its supervisee to 4 n2 sons, where ri 2 » n\. Repeat supervisor/supervisee allocation 
procedure as before, Continue with the new pairs of (supervisor, supervisee). Repeat N 
times. 

Now, as in [2], we are going to “remodel” martingales w,F,f, g to new functions with 
basically the same distributions. 

We first “square sine” and “square cosine” function for any supervisee interval /. Let 

sqsin Io (x) := H Io (4 ni x), sqcos Io (x) = G 4 n 1Io ( x ). 

Next supervisors will be the quarters of I o- Take on of such quarter, say, /, and put 

sqsini(x ) := 77/ (4 n2 x), sqcosi(x ) = G 4 n 2 i 0 (x). 

We continue doing that for the next generation of supervisors. Let 7, J be a supervi¬ 
sor/supervisee pair. Now let £jj be a natural linear map J —>• 7. We put sqsj := sqsinjoijj , 
sqcj = sqcosi o £jj. 

Now we basically want to put 

N 

W := w + EE E cisqsj , 

n= 1 1 n J supervised by I 
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where cj are coefficients of w. 

N 

<S>:=F + Y^ d l S( l s Ji 

n= 1 n J supervised by I 

where dj are coefficients of F. 

N 

<t> ••= f+^2 ^2 ^2 a i s Q s J , 

n— 1 l£j r ,£(I)=4~ n J supervised by I 

where aj are coefficients of f. 

N 

p-=22 J2 J2 b isqcj, 

n= 1 l£J r ,l(I)= 4 —n J supervised by I 

where bj = —aj are coefficients of g. Notice that the last formula has square cosines sqc, and 
this will be important. 

We do exactly that, but to ensure the doubling property of W we just for every pair (I, J ) 
(supervisor/supervisee) replace sqsj by basically the same function, but such that its first 4 
steps on the left are replaced by 0 and its first 4 steps on the right are replaced by zero. Call 
it sqsmj. So 

N 

W := w + EE E cjsqsmj , 

72=1 /G^ r ,^(/)=4 —n J supervised by I 

where cj are coefficients of w. 

The doubling property of such a W has been checked in [2]. We notice that if 1 « n\ « 
n 2 << ... << 77.7V then the distribution functions of these new function are basically the same 
that for their model martingales. So we can repeat Theorem l6.ll Let us consider the periodic 
extension of W, <J>, </>, p to the whole line (or we could consider everything just on the unit 
circle identifying it with [0,1)). 

Theorem 7.1. Given Q > 1 then the above functions W,Q,(f>,p are such that the following 
holds: 

1) For any J G F, (<L)j > (\<f)\)jmmjW. 

2) For any J € F, (W)j < Q min j W. 

3) For any J € T, bj = —aj, these are martingale differences coefficients for cj) and p. 

4) For a large positive number g, g ■ j' xGJo . p ( x )> g Wdx>c 0 Q log p Q f Io $dx, p < 

5) W is doubling with an absolute constant. 

Now what happens with the Hilbert transform Ftf of <fp. It is immediate that if we extend 
periodically scsinj 0 to the real line and do the same with sqcosj 0 and call them sqsin, sqcos, 
then 

(7.1) H(sqsin)(x ) = £(x) sqcos(x ), 

where £ is a non-negative 1-periodic function that looks as follows. It is logarithmically goes 
to Too at 0, at , at ^T and at 1. It has two zeros: at and at |. Continue it 1-periodically 
Let I be one of the supervisors of k -th generation. Put 

fl(x) := £(4 nfc x), x € I. 
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Let now I, J is the the pair of supervisor/supervisee. Recall that Ijj is the linear map from 
J to I sending the left (right) end-point to the left (right) end-point. We put 

ij ■= 6 ° t-u- 

It is now tempting (looking at the definition of 4>) to write that (recall that bj = —aj) 

N 

H(f>{x ) = EE E bi£,j(x)sqcj(x ). 

71=1 l£j r ,£(I)= 4 —n J supervisedby I 

Unfortunately, in (17.11) we have 1-periodic sqsin, sqcos and not their localized to Jo’s ver¬ 
sions. But H of any bounded highly oscillating function on interval J goes to zero uniformly 
outside the neighborhood of the end-points of J. Therefore we can make up for the problem 
with localized to J functions sqsj , sqcj by writing 

N 

(7.2) H<j)(x ) = EE E biO{x)sqcj(x ) + 0(x), 

n =1 7g^ 7 ,^(/)=4 —n J supervisedby I 

where Q(x) is as close to zero as we wish on a set of as small Lebesgue measure as we wish-by 
the choice of largeness of ni << tt -2 << • • • ■ 

We can think that in all our constructions all sums are finite. In particular, the coefficients 
aj,bj = —aj of f, g can be thought to be zero after a while. So let mo be the last generation 
where we have these coefficients non-zero. Then the set 

(7.3) u :={x e [0,1] : g(x) > g} 

consists of the collection of the whole intervals of the next generation mo + 1, that is consists 
of the certain sons of certain collection of Tadic intervals of generation mo whom we will call 
X. The set of their sons forming oj will be called X. Intervals I from X are the last intervals 
that are supervisee in the remodeling construction above. Their supervised intervals will be 
called collection J. Let I S X, J € X are supervisee/supervised pair. We do remodeling last 
time: divide / to its sons, divide J to 4 nm o equal intervals, make correspondence between the 
sons of I and some small intervals of this subdivision of J. Let the son of I happen to be 
in X, then we mark correspondent small intervals J' of this subdivision of J by red. All red 
intervals will be called collection J. Call it Q := U j’ejJ'- 
Now we can see that 

(7.4) Q = U = {x € [0,1] : p(x) > g}. 

In fact, let I' be an element of X, and J' be a corresponding red interval (from J). Fix 
any point i£l and consider g = biGj at x. Consider the sequence of the terms of this 
sum. Here the sum has a term bpGj,(x) from father I' of X, then a term from a grandfather, 
et cetera. And for any x € I' this sequence we just described is the same. If we consider now 
any y G J' and consider the sequence of terms in the sum p(y) = Y^j-j supervisedby I bisqcj(y) 
we will see that it is exactly the same sequence as for x € X. This was done by remodeling 
construction because each J" that gives the contribution to the sum at y has a supervisor I" 
that gives the same contribution to the sum at x. This proves (17.41) . This proves that g and 
p are distributed in the same way (with respect to Lebesgue measure, and also with respect 
to pair w, W correspondingly). 
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However, we need a subtler thing. The distribution of p is not enough for us, we need also 
the distribution of 


N 

H<Ky) ■= E E E bi£j(y)sqcj{y). 

n=1 l£j r ,£(I)= 4 —n J supervised by I 


The problem is of course that we have all these £j(y). In fact, if I' € X, the for any red 
interval J' supervised by I' (and any point y in any such J') we have one and the same 
sequence of numbers {bisqcj{y)} riz i, j issup ervisedb y I■ 

Call this sequence of numbers d(x). It is a finite sequence {d \,..., d mo } and if x € I' C u, 
then (see (17.31) 1 

di + ... d mo =: gi > g . 

We normalize by 6i := di/g\. Then in the corresponding y € J' we have the sum for Hf(y) 
which looks like 

mo 

i =1 

where Xi(y) is a corresponding sqcj(y), for example aq (y) = sqcj(y), where J is a father of 
J' and a supervisee of a father I of V. 

We have to notice that the sequence d[x) does not depend on x, depends only on I' € X, 
and hence the sequence {9 1 ,..., 9 mo } does not depend on y as long as y € J', and J' is a red 
interval corresponding to I'. 

But unfortunately £j(y) depend on y very much. In different red intervals J[, J 2 ,... cor¬ 
responding to the same I' the sequence {aq,... ,x mo } is completely different. 

Fix our I' € X, let J(I') be the union of all red J' corresponding to I'. On Y := U 
we introduce the probability measure as follows: choose any such J' with equal probability 
V ', and then put a normalized Lebesgue measure on it. 

Notice that the joint distribution of {aq (y),... ,x mo (y)}, y £7, with respect to this V' is 
almost the same as the joint distribution of independent random variables {£ 1 ,... Cm 0 } having 
the same distribution of our function £ on [0,1]. We can make closeness in joint distribution 
apparent by choosing very large n\ « 712 << .... 

Consider now two cases: 1) < On 2) ^ 9/ > co, where Co is a certain absolute 

constant. 

Let £ = C k = ik ~ C = @kCk- Let us think that f £ = 1 Notice that then by 

normalization of 0i we have IE(^£fc) = 1. 

Case 1). X*{£ < 1/2} = X{|£ — 1| > 1/2} < 4 Var{(/) < 4co- So if cq is happened to be 
= 1/8 we get that 

X{£ > 1/2} > 1/2. 

Then by the closeness in joint distribution we would conclude that 

m 0 

(7.5) V{J20iXi(v)> 1/2} >1/4. 

i =1 


Case 2). In This case the sum of variations of $&£& is sufficiently large. Now we will use 
then the following lemma: 

Lemma 7.2. Let 9k > 0, k = 0,..., m— 1. Let £& be WL-valued independent random variables 
with variation Ok satisfying 

(7.6) 


E|£*| p <Cp9 p k /2 ,p = 3A,... 
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Then there exists 5 = 5(C,c) > 0 such that 


(7.7) 


V 


m— 1 


k =0 


E^+ a M 


m —1 x 1/2 
— ,2 
k 


k =0 


> 5 /or a// a € M. 


Remark. Notice that function £ € BMO, so by John-Nirenberg inequality the requirement 
(17.61) hold for our 

h,k ■ @kfk- 

We will apply this Lemma to such and to a = 0. Notice that our £*. will be non-negative. 


Proof. Denote 


Take A > 0 and consider 


|Ee iAcr | = 


m —1 


a = 


E £fe+°> Cfc = It— 


fc =0 


m—1 


„zAa 


J Ee* A5fc = ]^[ | Ei 


fc=0 


m—1 


m—1 


D ^A£fc | — 


m* 


MO- 1 


/c=0 


k =0 


Note now that for A < 9 k (our A below will be such) we have by (17.61) 


|<fe : 


^ACfcI — 


l-^Var£ k + 0(\ 3 6l) 


A 


< exp ( - —Var^ k + CX 3 0f. ), 


and 


m— 1 


[ \Se iX< * k | < exp (—cA 2 E^ 2 + c<a3 E 1 


fc =0 


<exp(-c'|A(£ 9 i) 1/2 '- + C'h(£ 


1 / 2 ' 


1 3 


Now choose 


Then 




A = '0? 


|<£’e lAcr | < exp ( - 

On the other hand, for every <5 > 0, one has 


- 1/2 


V13 


8 ( C ') 2 


|Ee''" - 1 <Ai(j/«j) 1/2 + 2-p||(r| > ■* (XX' 

<^i +2 p{H >«($;«?' 1/11 


Hence, 


p{m>«(E‘ 

if 5 is chosen small enough. 


1/2 ) 1 

> - 

“ 2 


1 — exp — 


(c') 


,/\3 


8(C') 


2 a 


>5, 


□ 
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The terms of the sum JT 9 iXi(y) are almost constant functions on each red interval J' € X'. 
We already proved in (17.51) . (17.71) that probability V of the sum JT Okt,k is larger than certain 
fixed absolute 5 is at least 5. Therefore we may think that at least 5/2 portion of red intervals 
J' E X' are such that for the sum 9iXi we have 

min 6,x l > 5/2. 

i 

Denote this collection of J' by symbol C(I'). 

Now the previous inequality translates into 

£ 

H4>(x) > -g, 

on all J' from the portions C(I') described above for all intervals I' E X. 

Now use 4) of Theorem 17. II The estimate in 4) p{x) > g holds on all red J' corresponding 
to any I' E X (see (17. 41) 1. The IT-measure of the union of them is large as indicated in 4), 
namely, > ^Q\og p Q f \4>(x)W(x) dx. 

Notice that all red intervals J' from X' have the same W measure (by construction of W). 
Therefore, the W-measure of all these portions of red intervals described above (portions 
are enumerated by I' E X) is at least 5/A times cQ\og v Q f \<j>(x)\W{x) dx. So on such W- 
measure we have Hcf>(x) > | g. This is exactly what we need if we take into consideration 
that 0(x) in (|7.2D can be taken as small as we wish outside the set of Lebesgue measure (and 
then obviously of Wdx measure as well) as small as we wish. 
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